AITopics

Technology: Information Technology > Artificial Intelligence (1.00)

Neural Information Processing SystemsDec-25-2025, 11:22:14 GMT

Is Deeper Better only when Shallow is Good?

Understanding the power of depth in feed-forward neural networks is an ongoing challenge in the field of deep learning theory. While current works account for the importance of depth for the expressive power of neural-networks, it remains an open question whether these benefits are exploited during a gradient-based optimization process. In this work we explore the relation between expressivity properties of deep networks and the ability to train them efficiently using gradient-based algorithms. We give a depth separation argument for distributions with fractal structure, showing that they can be expressed efficiently by deep networks, but not with shallow ones. These distributions have a natural coarse-to-fine structure, and we show that the balance between the coarse and fine details has a crucial effect on whether the optimization process is likely to succeed. We prove that when the distribution is concentrated on the fine details, gradient-based algorithms are likely to fail. Using this result we prove that, at least in some distributions, the success of learning deep networks depends on whether the distribution can be approximated by shallower networks, and we conjecture that this property holds in general.

deeper better only, name change, proceedings, (4 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Neural Information Processing SystemsAug-19-2025, 07:42:30 GMT

Supplementary Material for " FreGAN: Exploiting Frequency Components for Training GANs under Limited Data " Mengping Y ang Zhe Wang Ziqiu Chi Y anbing Zhang

Our FreGAN improves the overall quality of generated images and raises the model's frequency awareness, encouraging the generator to produce precise

artificial intelligence, fregan, machine learning, (15 more...)

Country:

Asia > China (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Vision (0.96)

Neural Information Processing SystemsAug-15-2025, 10:15:40 GMT

Figure 5: Average test SSIM in various scenarios, across a range of acceleration factors

Shaded regions indicate 95% confidence intervals.

acceleration factor, nullnull, reconstruction, (16 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Health & Medicine > Diagnostic Medicine (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

arXiv.org Artificial IntelligenceJul-30-2025

APT: Improving Diffusion Models for High Resolution Image Generation with Adaptive Path Tracing

Han, Sangmin, Jeong, Jinho, Kim, Jinwoo, Kim, Seon Joo

Latent Diffusion Models (LDMs) are generally trained at fixed resolutions, limiting their capability when scaling up to high-resolution images. While training-based approaches address this limitation by training on high-resolution datasets, they require large amounts of data and considerable computational resources, making them less practical. Consequently, training-free methods, particularly patch-based approaches, have become a popular alternative. These methods divide an image into patches and fuse the denoising paths of each patch, showing strong performance on high-resolution generation. However, we observe two critical issues for patch-based approaches, which we call "patch-level distribution shift" and "increased patch monotonicity. " T o address these issues, we propose Adaptive Path Tracing (APT), a framework that combines Statistical Matching to ensure patch distributions remain consistent in upsampled latents and Scale-aware Scheduling to deal with the patch monotonicity. As a result, APT produces clearer and more refined details in high-resolution images. In addition, APT enables a shortcut denoising process, resulting in faster sampling with minimal quality degradation. Our experimental results confirm that APT produces more detailed outputs with improved inference speed, providing a practical approach to high-resolution image generation. 1

artificial intelligence, machine learning, resolution, (16 more...)

2507.2169

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.73)

Vontobel, Tobias, Sadat, Seyedmorteza, Salehi, Farnood, Weber, Romann M.

HiWave: Training-Free High-Resolution Image Generation via Wavelet-Based Diffusion Sampling

arXiv.org Artificial IntelligenceJun-26-2025

Diffusion models have emerged as the leading approach for image synthesis, demonstrating exceptional photorealism and diversity. However, training diffusion models at high resolutions remains computationally prohibitive, and existing zero-shot generation techniques for synthesizing images beyond training resolutions often produce artifacts, including object duplication and spatial incoherence. In this paper, we introduce HiWave, a training-free, zero-shot approach that substantially enhances visual fidelity and structural coherence in ultra-high-resolution image synthesis using pretrained diffusion models. Our method employs a two-stage pipeline: generating a base image from the pretrained model followed by a patch-wise DDIM inversion step and a novel wavelet-based detail enhancer module. Specifically, we first utilize inversion methods to derive initial noise vectors that preserve global coherence from the base image. Subsequently, during sampling, our wavelet-domain detail enhancer retains low-frequency components from the base image to ensure structural consistency, while selectively guiding high-frequency components to enrich fine details and textures. Extensive evaluations using Stable Diffusion XL demonstrate that HiWave effectively mitigates common visual artifacts seen in prior methods, achieving superior perceptual quality. A user study confirmed HiWave's performance, where it was preferred over the state-of-the-art alternative in more than 80% of comparisons, highlighting its effectiveness for high-quality, ultra-high-resolution image synthesis without requiring retraining or architectural modifications.

artificial intelligence, machine learning, resolution, (15 more...)

2506.20452

Country:

North America (0.46)
Europe > Switzerland (0.15)

Genre: Research Report > New Finding (0.66)

Industry:

Media (0.46)
Leisure & Entertainment (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

PCWorldMay-14-2025, 17:00:00 GMT

New Winxvideo AI – One-stop Video/Image Enhancer & Toolkit

We seem to have more video footage and still images than ever before, thanks to smartphones, GoPro cameras and the backlog of older ones collected across a lifetime. Managing all these formats, as well as making sure they look their best, can be a frightening proposition. Thankfully, Winxvideo AI is a powerful all-in-one solution that not only uses advanced Artificial Intelligence software to upgrade the quality of your content but can rescue old photos and footage too. The newly updated version 4.0 also brings huge improvements to speed, plus a special price offer, so you can save both time and money while you upgrade your photo and video library. Winxvideo AI comes with an impressive array of features that can turn tired, old, blurry videos into something far more professional.

artificial intelligence, video image enhancer & toolkit, winxvideo ai, (11 more...)

PCWorld

Industry: Media > Photography (0.72)

Technology: Information Technology > Artificial Intelligence (1.00)

Lee, Jee Won, Lim, Hansol, Yang, SooYeun, Choi, Jongseong Brad

MATT-GS: Masked Attention-based 3DGS for Robot Perception and Object Detection

arXiv.org Artificial IntelligenceMar-24-2025

This paper presents a novel masked attention-based 3D Gaussian Splatting (3DGS) approach to enhance robotic perception and object detection in industrial and smart factory environments. U2-Net is employed for background removal to isolate target objects from raw images, thereby minimizing clutter and ensuring that the model processes only relevant data. Additionally, a Sobel filter-based attention mechanism is integrated into the 3DGS framework to enhance fine details - capturing critical features such as screws, wires, and intricate textures essential for high-precision tasks. We validate our approach using quantitative metrics, including L1 loss, SSIM, PSNR, comparing the performance of the background-removed and attention-incorporated 3DGS model against the ground truth images and the original 3DGS training baseline. The results demonstrate significant improves in visual fidelity and detail preservation, highlighting the effectiveness of our method in enhancing robotic vision for object recognition and manipulation in complex industrial settings.

artificial intelligence, fine detail, reconstruction, (14 more...)

2503.1933

Country:

Asia > China > Zhejiang Province > Hangzhou (0.05)
North America > United States > New York > Suffolk County > Stony Brook (0.05)
Europe > Switzerland (0.05)
(2 more...)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)

arXiv.org Artificial IntelligenceFeb-21-2025

Super-Resolution for Interferometric Imaging: Model Comparisons and Performance Analysis

Abdioglu, Hasan Berkay, Gursoy, Rana, Isik, Yagmur, Balci, Ibrahim Cem, Unal, Taha, Bayer, Kerem, Inal, Mustafa Ismail, Serin, Nehir, Kosar, Muhammed Furkan, Esmer, Gokhan Bora, Uvet, Huseyin

This study investigates the application of Super-Resolution techniques in holographic microscopy to enhance quantitative phase imaging. An off-axis Mach-Zehnder interferometric setup was employed to capture interferograms. The study evaluates two Super-Resolution models, RCAN and Real-ESRGAN, for their effectiveness in reconstructing high-resolution interferograms from a microparticle-based dataset. The models were assessed using two primary approaches: image-based analysis for structural detail enhancement and morphological evaluation for maintaining sample integrity and phase map accuracy. The results demonstrate that RCAN achieves superior numerical precision, making it ideal for applications requiring highly accurate phase map reconstruction, while Real-ESRGAN enhances visual quality and structural coherence, making it suitable for visualization-focused applications. This study highlights the potential of Super-Resolution models in overcoming diffraction-imposed resolution limitations in holographic microscopy, opening the way for improved imaging techniques in biomedical diagnostics, materials science, and other high-precision fields.

interferometric imaging, real-esrgan, super-resolution, (14 more...)